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ABSTRACT 

We present the results from a damped Lya survey of the Sloan Digital Sky Survey, Data Release 3. 
We have discovered over 500 new damped Lya systems at z > 2.2 and the complete statistical sample 
for z > 1.6 has more than 600 damped Lya galaxies. We measure the H I column density distribution 
/hi (AT, X) and its zeroth and first moments (the incidence ^dla and gas mass-density f2° LA of damped 
Lya systems, respectively) as a function of redshift. The key results include: (1) the full SDSS-DR3 
/hi (A, X) distribution (z ~ 3.06) is well fit by a T-function (or double power-law) with 'break' column 
density N 1 = 10 21 ' 5±01 cm" 2 and 'faint-end' slope a = -1.8 ± 0.1; (2) the shape of the f m (N,X) 
distributions in a series of redshift bins does not show evolution; (3) the incidence and gas mass density 
of damped systems decrease by 35 ± 9% and 50 ± 10% during w 1 Gyr between the redshift interval 
z = [3., 3.5] to z = [2.2, 2.5]; and (4) the incidence and gas mass density of damped Lya systems in the 
lowest SDSS redshift bin (z = 2.2) are consistent with the current values. We investigate a number of 
systematic errors in damped Lya analysis and identify only one important effect: we measure 40 ± 20% 
higher f^ LA values toward a subset of brighter quasars than toward a faint subset. This effect is contrary 
to the bias associated with dust obscuration and suggests that gravitational lensing may be important. 
Comparing the results against several models of galaxy formation in ACDM, we find all of the models 
significantly undcrpredict ^dla at z — 3 and only SPH models with significant feedback (Nagamine et 
al.) may reproduce fl^ LA at high redshift. Based on our results for the damped Lya systems, we argue 
that the Lyman limit systems contribute w 33% of the universe's H I atoms at all redshifts z — 2 to 5. 
Furthermore, we infer that the /hi (A, X) distribution for Ahi < 10 20 cm~ 2 has an inflection with slope 
d log f/d log A > — 1. We advocate a new mass density definition - the mass density of predominantly 
neutral gas f^ cut - to be contrasted with the mass density of gas associated with H I atoms. We contend 
the damped Lya systems contribute > 80% of f2^ cut at all redshifts and therefore are the main reservoirs 
for star formation. 



Subject headings: galaxies: evolution — intergalactic medium — quasars: absorption lines 



1. INTRODUCTION 

The damped Lya systems are the class of quasar ab- 
sorption line systems with H I column density Ahi > 
2 x 10 20 cm~ 2 (see Wolfe, Gawiser, & Prochaska 2005, for 
a review). Unlike the Lya forest, the damped Lya systems 
are comprised of predominantly neutral gas and are pro- 
posed to be the progenitors of galaxies like the Milky Way 
(e.g. Kauffmann 1996). Wolfe et al. (1986) established the 
Nth — 2 x 10 20 cm -2 threshold primarily to correspond to 
the surface density limit of local 21 cm observations at that 
time. It is somewhat fortuitous that this threshold roughly 
corresponds to the transition from primarily ionized gas to 
predominantly neutral gas (e.g. Viegas 1995; Prochaska 
& Wolfe 1996; Prochaska 1999; Vladilo et al. 2001). 

For the past two decades, several groups have surveyed 
high z quasars for the damped Lya systems (Wolfe et 
al. 1986, 1995; Storrie-Lombardi et al. 1996a; Storrie- 
Lombardi & Wolfe 2000; Peroux et al. 2003). These sur- 
veys measured the H I frequency distribution function and 
its moments: the incidence of the damped Lya systems 
^dla and the gas mass density of these galaxies f2^ LA . 
The latter quantity has cosmological significance. Its evo- 
lution constrains the build-up of structure within hierar- 



chical cosmology (e.g. Ma & Bertschinger 1994; Klypin et 
al. 1995), it serves as the important neutral gas reservoir 
for star formation at high redshift and describes the com- 
petition between gas accretion and star formation (e.g. Fall 
& Pei 1993), and it constrains models of galaxy forma- 
tion in hierarchical cosmology (e.g. Somerville, Primack, 
& Faber 2001; Cen et al. 2003; Nagamine, Springel, & 
Hernquist 2004) . Previous surveys have reported statisti- 
cal error on ft® LA of w 30% in redshift intervals Az « 0.5 
at high redshift. As we enter the so-called era of precision 
cosmology, we aspire to constrain ft° LA to better than 
10%. Although not formally a cosmological parameter, a 
precise determination of ft® LA and its redshift evolution 
are fundamental constraints on any cosmological theory of 
galaxy formation. 

In Prochaska & Herbert-Fort (2004), hereafter PH04, 
we initiated a survey for the damped Lya systems in the 
quasar spectra of the Sloan Digital Sky Survey (SDSS) 
Data Release 1 (DR1). We demonstrated that the spectral 
resolution, signal-to- noise ratio (SNR), and wavelength cov- 
erage of the SDSS spectra are well suited to survey the 
damped Lya systems at z > 2.2. We reported on the 
number of damped Lya systems per unit redshift and the 
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Table 1 

SDSS-DR3 QUASAR SAMPLE 



Plate 


MJD 


FibcrlD 


Name 


Zqso 


Ibal 


Zi 


4 


z f 


^candidate 


650 


52143 


178 


J000050.60-102155.8 


2.640 





2.200 


2.200 


2.604 




750 


52235 


550 


J000143.41+152021.4 


2.638 





2.200 


2.200 


2.602 




650 


52143 


519 


J000159. 12-094712.4 


2.308 





2.203 


2.250 


2.275 




387 


51791 


556 


J000221. 11+002149.4 


3.057 





2.200 


2.209 


3.016 


1.958 


650 


52143 


111 


J000238.41-101149.8 


3.941 





3.203 




3.891 


3.271,3.523,3.605,3.655 


750 


52235 


608 


J000300.34+160027.7 


3.675 





3.285 


3.480 


3.629 




650 


52143 


48 


J000303. 34-105150.6 


3.647 


2 






2.897,3.465 




750 


52235 


36 


J000335. 21+144743.6 


3.484 













650 


52143 


604 


J000413. 63-085529.5 


2.424 





2.200 


2.200 


2.389 




650 


52143 


605 


J000424.16-085047.9 


2.433 





2.200 


2.200 


2.399 





a 0=No intrinsic absorption; l=Mild intrinsic absorption, included in analysis with restriction; 2=Strong intrinsic absorption, excluded. 
^Starting rcdshift for SNR iim = 5. 

Note. — [The complete version of this table is in the electronic edition of the Journal. The printed edition contains only a sample] 



neutral gas mass density. In this paper, we extend the sur- 
vey to include the full Data Release 3. In addition to sub- 
stantially increasing the PH04 sample, this paper extends 
the analysis to include a determination of the N (HI) fre- 
quency distribution, fni(N, X). Furthermore, we perform 
a series of tests to examine systematic error related to the 
analysis. With the increased sample size, we believe the 
systematic uncertainty is as important as statistical uncer- 
tainty. Similarly, uncertainty related to selection bias (e.g. 
dust obscuration) must be given careful attention. Future 
progress will require significant strides on each of these 
three fronts. 

This paper is organized as follows: § 2 describes the 
SDSS quasar sample and defines the sub-set used to sur- 
vey the damped Lya systems; we present the damped 
Lya candidates and the TVhi measurements in § 3, we per- 
form standard analysis of the H I distribution in § 4, and 
we discuss the results in § 6. Aside from comparisons 
against local observations, we mainly restrict the analy- 
sis to optical surveys for the damped Lya systems (i.e. 
z > 1.6). Unless it is otherwise stated all log values and 
expressions correspond to log base 10. Here and through- 
out the paper we adopt cosmological parameters consis- 
tent with the latest WMAP results (Bennett et al. 2003): 
A = 0.7, fl m = 0.3, H Q = 70km s" 1 Mpc" 1 . 

2. SDSS SAMPLE 

2.1. Redshift Path 

The quasar sample considered in this paper includes ev- 
ery object identified spectroscopically as a quasar with 
z > 2.2 in the SDSS-DR3 1 . We also include the full com- 
pilation of quasars and damped Lya systems (avoiding 
duplication) from the previous two decades of research as 
compiled by Peroux et al. (2003). In each of these SDSS 
quasars we have defined a redshift interval where we search 
for the presence of damped Lya systems. In this paper, we 
define the redshift path using an algorithm similar to that 

1 Specifically, where SPECCLASS=3 or 4 in SPECPHOTO for 
SDSS-DR3. 



introduced in PH04. We also apply more conservative cri- 
teria to investigate the effects of SNR (the estimated flux 
divided by estimated noise). 

The starting redshift Zi is defined as follows. First, we 
identify the minimum wavelength A^ where the median 
SNR in a 20 pixel interval exceeds SNR Hm . In PH04, 
we took SNR; im = 4 and in this paper we also consider 
larger values. An assumption of our prescription is that 
the median SNR docs not decrease significantly at wave- 
lengths greater than A^ unless a damped Lya candidate is 
present. This is a good assumption unless \ coincides with 
the Ly/3/O VI emission feature of the quasar. At these 
wavelengths, the SNR of the data is elevated and it is pos- 
sible the median SNR may be above SNRjj m in this spec- 
tral region but below SNR; im otherwise. Therefore, in the 
10000km s" 1 region centered at A = 1025.7(1 + z qso ) we 
demand that the median SNR be greater than 3*SNR;i m 
if \ is to be set within that region. In practice, this re- 
moves a number of faint quasars from the sample whose 
spectra have median SNR greater than SNR; im only in 
the Ly/3/O VI emission line. Finally, we define the start- 
ing redshift for the damped Lya search: 

Zl = (Ai/1215. 67) - 1. + 0.005 , (1) 

where the 0.005 increment offsets Zi by 1500km s _1 . Fi- 
nally, we demand Zi > 2.2. 

Similarly, we define an ending rcdshift for the statistical 
pathlcngth of a given quasar, 

z f = 0.99z qso - 0.01 , (2) 

which corresponds to 3000 km s _1 blueward of the Lya 
emission feature. The offset minimizes the likelihood that 
a damped Lya system is physically associated with the 
quasar. For reasons unknown to us, a small fraction of the 
quasar spectra have extended regions (> 50 pixels) with 
zero flux and null values in their error array. In these cases, 
we set Zf to 1000 km s _1 blueward of the 'null region'. 

Each quasar spectrum was visually inspected and char- 
acterized according to the presence or absence of features 
identified with intrinsic absorption (e.g. BAL quasars). 
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This is a necessary step in damped Lya surveys because 
quasars with strong intrinsic N V and O VI absorption 
can be confused with the Lya transition of an intervening 
quasar absorption line system. Following PH04, we di- 
vided the quasars into three categories: (a) quasars with- 
out significant intrinsic absorption; (b) mild absorption- 
line quasars which show modest absorption at the C IV 
emission feature; and (c) strong absorption-line quasars 
whose C IV, O VI lines have large equivalent widths and 
could be confused with a damped Lya system. The lat- 
ter category is discarded from all subsequent analysis. For 
the mild absorption-line quasars, however, we search for 
damped Lya systems in the redshift interval max(zi, zbal) < 
z < min(z qso — 0.08226, Zf) where z B al = (^+z qso )(W60/ X Lya ) 
1 and the modification to the ending redshift minimizes the 
likelihood of misidentifying N V absorption as a damped 
Lya system. 

Table 1 presents the full list of SDSS-DR3 quasars ana- 
lyzed here. Columns 1-9 designate the SDSS plate, MJD, 
and fiber numbers, the quasar name, the emission red- 
shift, a flag describing intrinsic absorption, Zi and z/ for 
SNR;j m =4, and the absorption redshift of any damped 
Lya candidates along the sightline. The latter may in- 
clude candidates where z a b s is not in the [zi,Zf] interval. 
Also, many candidates are false positive detections, in par- 
ticular BAL absorption lines. 
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Fig. 1. — Redshift sensitivity function g(z) as a function of redshift 
for the LBQS survey, the compilation of Peroux et al. (2003), the 
SDSS-DR1 sample from PH04, the SDSS-DR3 sample studied here, 
and the total sample. The gray dashed-line traces the ratio of the 
SDSS-DR3 sample to the previous 20 year compilation (Peroux et 
al. 2003). 



2.2. g(z) 

The survey size of a quasar absorption line study is char- 
acterized through the redshift sensitivity function g(z). 
Figure 1 presents a series of g(z) curves for the the Large 
Bright Quasar Survey (LBQS) search for damped Lya 
systems (Wolfe et al. 1995), the compilation of Peroux 
et al. (2003), and the SDSS-DR1 sub-sample of PH04. 
These are compared against the full SDSS-DR3 sample 
with SNR« TO =4. Note that the SDSS data releases are 



inclusive and also that the Peroux et al. (2003) compi- 
lation includes the LBQS sample. Comparing the curves, 
we find the current SDSS sample now exceeds the previous 
surveys by an order of magnitude at z ~ 3, several times 
at z ~ 4, and more than a factor of 10 at z > 4.6 (the gray 
dashed-line traces the ratio of SDSS-DR3 to the previous 
surveys). The dip at z ~ 4 in the ratio of SDSS-DR3 to 
the previous surveys is largely due to the impressive high 
z survey carried out by the Cambridge group. We also 
stress that the increased sensitivity of the SDSS-DR3 at 
z > 4 should be viewed conservatively; follow-up obser- 
vations should be performed to confirm the high redshift 
results. In Figure 2 we show the g(z) curves for several 
-euts on SNR; im and sample selection for the SDSS-DR3 
quasars. These cuts are summarized in Table 2 and will 
be referred to throughout the text. 
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Fig. 2. — Redshift sensitivity function g(z) as a function of redshift 
for several cuts of the SDSS-DR3 data (Table 2). 



In contrast with the g{z) curves for the Lya forest or 
Mg II systems (e.g. Prochter, Prochaska, & Buries 2005), 
one notes no significant features due to strong sky lines. 
This is because the sky lines can only lead to false positive 
detections in our algorithm. These are easily identified 
and ignored. 



Table 2 
SDSS Cuts 



Label 


SNR Km 


CLR a 




SDSS-DR3.4 


4 


no 


525 


SDSS-DR3.4C 


4 


yes 


340 


SDSS-DR3.5 


5 


no 


395 


SDSS-DR3.5C 


5 


yes 


183 


SDSS-DR3.8 


8 


no 


155 


SDSS-DR3.8C 


8 


yes 


88 



This entry refers to whether the quasar sample adheres to the color criteria d( 
color-cut by only including quasars drawn from plates 761 and greater. 
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3. 2V H i MEASUREMENTS 
3.1. Damped Lya Candidates 

Damped Lya candidates were identified in the quasar 
spectra using the same prescription introduced by PH04. 
In short, the algorithm compares the SNR of the pixels in 
a running window of width 6(1 + z) A against xsnr specifi- 
cally definied to be the median SNR of the 151 pixels start- 
ing 200 pixels redward of the Lya emission peak divided 
by 2.5, i.e. xsNR gauges the SNR of the quasar contin- 
uum just redward of Lya emission. Furthermore, xsnr is 
restricted to have a minimum value of 1 and a maximum 
value of 2. If the fraction of pixels with SNR> xsnr is 
< 60%, a damped Lya candidate is recorded at the cen- 
ter of the window. This candidate list is supplemented by 
systems associated with relatively strong metal-line tran- 
sitions outside the Lya forest (see Herbert-Fort et al. 2005, 
in preparation). Finally, the list is further supplemented 
by our visual inspection of each quasar spectrum when 
characterizing its intrinsic absorption. The damped Lya 
candidates are listed in Table 1. 

In PH04, we reported that Monte Carlo tests of our au- 
tomated algorithm on synthetic spectra implied an ideal- 
ized damped Lya completeness ~ 95% for Ahi ~ 2 x 10 20 cm 
and 100% for log Am > 20.4. Given the supplemental can- 
didates from the metal-line search and our visual inspec- 
tion, we believe the completeness definitely exceeds 95% 
for all absorbers with TVhi > 2 x 10 20 cm~ 2 . We discuss a 
new set of completeness tests below. 

Every damped Lya candidate was subjected to the fol- 
lowing analysis. First, the Lya profile was visually in- 
spected and obvious false-positive candidates were elimi- 
nated. This visual analysis included overplotting a Lya 
profile with Nm = 2 x 10 20 cm -2 . To minimize the labor 
of fitting Lya profiles, we chose not to fit many systems 
where the overplotted profile clearly was a poor solution. 
It was our experience that nearly all of these candidates 
have best-fit values Nm < 1.5 x 10 20 cm~ 2 . Second, we 
searched for metal-line absorption at redshifts near the 
estimated redshift centroid of the Lya profile. The exten- 
sive wavelength coverage of SDSS spectra is a tremendous 
advantage in the analysis of damped Lya candidates. In 
general, we focused on the strongest low-ion transitions 
observed in the damped Lya systems (e.g. Prochaska et 
al. 2003a): Si II A1260, 1304, 1526, O I A1302, C II A1334, 
Al IIA1670, and Fe II A1608,2382,2600. We characterized 
the metal-line absorption for each candidate as: (a) no 
metals detected 2 ; (b) weak or ambiguous metal absorp- 
tion; and (c) secure metal-line absorption. Damped Lya 
candidates in the latter category generally exhibit two 
or more metal-line features outside the Lya forest. For 
damped Lya systems with secure mctal-linc absorption, 
we constrain the subsequent Lya profiles to coincide with 
the metal-line redshift. 

3.2. Lya Fits 

Those damped Lya candidates which were not rejected 
by visual inspection were fitted with Voigt profiles in a 
semi-quantitative fashion. As discussed by Prochaska et 

2 We note that these damped Lya candidates may very well 
exhibit metal-line absorption at higher spectral resolution and/or 
within the Lya forest. 



al. (2003a), systematic error associated with continuum 
placement and line-blending generally dominates the sta- 
tistical error associated with Poisson noise in the quasar 
flux. We have fitted the Lya profile with an IDL tool 
x_fitdla which allows the user to interactively modify the 
Voigt profiles and continuum placements. We then esti- 
mated a conservative uncertainty to each Ahi value based 
on the constraints of the quasar continuum near the Lya 
profile, the degree of line blending, and the Poisson noise of 
the data. Bolton et al. (2004) have emphasized that the 
SDSS spectroscopic data has non-Gaussian fluctuations, 
e.g. 3<7 departures from the 'true' value with higher than 
0.27% frequency. Therefore, we have generally ignored 
single pixel outliers when performing the Lya fits. 

To be conservative, we adopt a minimum uncertainty 
of 0.15 dex in logAni for the absorption systems with un- 
ambiguous metal-line detections and 0.20 dex otherwise. 
These uncertainties are larger than those reported in PH04. 
As noted above, the effects of line blending and contin- 
uum uncertainty dominate the measurement uncertainty 
of the TVhi values. Furthermore, it is very unlikely that 
the errors related to these systematic effects are Gaussian 
distributed. At present, it is very difficult to accurately 
2 assess the magnitude and distribution of the errors in the 
Ahi values. We discuss below an attempt based on the 
analysis of mock spectra, yet even this approach has lim- 
ited value. Nevertheless, we contend that 95% of the true 
A/hi values will lie within 2a of the reported Ahi values, 
but we cannot argue that the reported errors are Gaus- 
sian distributed. This remains a significant failing in the 
analysis of DLA surveys. 

Figure 3 shows four example profile fits from the SDSS 
sample which illustrate several of the key issues. In Fig- 
ure 3a we present a high signal-to-noise case with a well 
constrained quasar continuum and the redshift centroided 




4050 4100 4150 4200 4250 4350 4400 




3900 3950 4000 4050 4100 4150 4650 4700 4750 



Fig. 3. — Example profile fits to four damped Lya systems from 
our SDSS-DR3 survey. The examples represent (a) a high SNR 
case with a well constrained quasar continuum and minimal line 
blending (<r(JV H l) = 0.15); (b) a low SNR case (a(N m ) = 0.30); (c) 
an example with severe line blending (<t(./Vhi) = 0.20); and (d) an 
example where determining the redshift from associated metal-line 
absorption is very important (ct(A'hi) = 0.15). 
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by metal-line detections. The fit is constrained by data 
both in the core and wings of the Lya profile and the 
adopted 0.15 dex uncertainty is overly conservative. Fig- 
ure 3b presents a low signal-to-noise case. In this case, 
it is very difficult to constrain the fit based on the core 
of the Lya profile. Nevertheless, the continuum level is 
reasonably well constrained and the wings of the profile 
set the ./Vhi value. Figure 3c presents a case where line- 
blending is severe and the ./Vhi value is only constrained 
by the wings of the profile. In these cases the continuum 
placement is critical. Finally, Figure 3d emphasizes the im- 
portance of metal-line detection. In this example, the Lya 
profile is centered on the redshift of the metal-line tran- 
sitions. Without this constraint, we would have centered 
the Lya profile a few Angstroms redward of the current 
position and derived a larger A H i value. We suspect that 
some damped Lya systems without metal-line detections 
have ./Vhi values that are biased high (w 0.1 to 0.2 dex). 
Of course, these damped Lya systems tend to have lower 
Ahi values and this systematic effect is primarily an issue 
only for candidates with /V H i ~ 2 x 10 20 cm~ 2 . Finally, 
we should stress that the Ahi values of damped Lya can- 
didates near the quasar O VI or Lya emission features 
are difficult to constrain because of uncertain continuum 
placement. 

All of the fits from the DR3 sample and the Lya fits 
from are presented at 

http://www.ucolick.org/ ~xavier/SDSSDLA/indcx.html. 
In addition to postscript plots, we provide the files (IDL 
format) which contain the Lya fits and quasar continua, 
and also a description of the software used to perform the 
fits. Interested readers can reanalyze any of our Lya fits. 



Table 4 

SDSS-DR3 DLA NON-STATISTICAL SAMPLE 



Name 


z abs 


DR 


fa 


log Nni 


J001115. 2+144601. 8 


3.4523 


2 


2 


21.65+S 


J001134.5+155137.4 


4.3175 


2 


1 


20.50^ 


.T001134.5+155137.4 


4.3592 


2 


2 


2i.io±°;*° 


.T004950. 9-093035. 6 


3.2858 


2 


2 


20.70±° .fo 


.T014214. 7+002324. 3 


3.3482 


1 


2 


20.40lS-.iS 


J020651. 4-094141. 3 


2.4702 


2 


2 


20.301°;™ 


.T021232. 1-100422.1 


2.7140 


1 





2i.ooi - 


.T023148. 8-073906. 3 


2.2982 


3 


2 


„, --+0.20 
z±. 1 O_ 20 


J033344. 4-060625.1 


3.9349 


3 


2 


21.60tr 2 °o 


J073149. 5+285448. 7 


2.6859 


2 


1 


20.551°;- 



a 0=No metals; l=Weak metals; 2=Metals detected 

Note. — [The complete version of this table is in the electronic 
edition of the Journal. The printed edition contains only a sample.] 



Table 5 

SDSS-DR3 SUPER-LLS SAMPLE 



Quasar 


Zabs 


DR 


SNR 


CLR 


it 


log Nm 


J001328. 2+135828.0 


2.6123 


3 


5 


n 


1 


20.101°,- 


J002614. 6+143105. 2 


3.9039 


2 


5 


n 


2 


20.10±g;i| 


J004732. 7+002111. 3 


2.4687 


2 


5 


n 


2 


20.001°;- 


J013317.7+144300.3 


2.4754 


2 


4 


n 


2 


20.001";™ 


,1013317.7+144300.3 


2.9766 


2 


8 


n 


2 


20.15t° .iS 


J014609. 3-092918. 2 


3.6804 


2 


8 


n 


1 


20.25±r 2 °o 


J021143. 3-084723. 8 


2.2684 


2 


4 


y 


2 


20.io±g;i| 


J021232. 1-100422.1 


2.2738 


1 


8 


n 


2 


20.251°,- 


J025039.1-065405.1 


4.3894 


3 


4 


n 


2 


20.00+°;f 5 


J031036.8+005521.7 


3.1142 


2 


5 


n 


2 


20.20!°;- 



Table 3 

SDSS-DR3 DLA STATISTICAL SAMPLE 



Quasar 


Zabs 


DR 


SNR 


CLR 


it 


log JV H i 


.1000238.4-101149.8 


3.2674 


2 


5 


n 


2 


21.20+'^ 


J000536.3+135949.4 


3.4896 


2 


5 


n 


2 


20.301°;- 


J001115. 2+144601. 8 


3.6118 


2 


8 


n 


1 


20.60±c!: 2 25 


J001240. 5+135236. 7 


3.0217 


2 


8 


n 


2 


20.551°;- 


J001328. 2+135827. 9 


3.2808 


2 


8 


n 


2 


21.551°;- 


J001918. 4+150611. 3 


3.9710 


2 


5 


n 


2 


20 40+ 20 


J002614. 6+143105. 2 


3.3882 


2 


5 


n 


2 


20.701°;- 


J003126. 7+150739. 5 


3.3583 


2 


8 


n 


2 


20.951S 


J003501. 8-091817. 6 


2.3376 


1 


8 


n 


1 


20.551°;- 


J003749.1+155208.3 


3.4790 


2 


4 


n 


1 


20.35l°;f 



a 0=No metals; l=Wcak metals; 2=Mctals detected 



a 0=No metals; l=Weak metals; 2=Metals detected 

Note. — [The complete version of this table is in the electronic 
edition of the Journal. The printed edition contains only a sample.] 



trum analyzed 3 , present the metal-line characteristics, and 
list the Ahi value and error. The non-statistical damped 
Lya sample is comprised of damped Lya systems with 
z a bs ~ z qso , z aos < Zi, and/or systems toward quasars 
with strong intrinsic absorption. We present this Table 
for completeness. We emphasize that while the absorbers 
listed in Table 5 have a central value below 2 x 10 20 cm~ 2 , 
the true A H i value of many of these absorbers will exceed 
the damped Lya threshold. 



Note. — [The complete version of this table is in the electronic 
edition of the Journal. The printed edition contains only a sample.] 

Tables 3 to 5 present the Ahi values for the damped Lya 
systems comprising the statistical sample, the damped 
Lya systems discovered which are not in any of our statis- 
tical samples, and all of the damped Lya candidates with 
central Ahi values less than 2 x 10 20 cm~ 2 (termed super- 
LLS). Regarding Table 3, we list the damped Lya absorp- 
tion redshift, indicate its membership within the various 
statistical cuts, list the data release of the actual spec- 



3.3. Completeness Tests and Ahi Accuracy 

There are several tests one can perform to assess the 
reliability and completeness of the Ahi analysis. A valu- 
able test of the Ahi accuracy is to compare the SDSS 
measurements against values derived from observations at 
higher spectral resolution and/or SNR. Over the past sev- 
eral years, we have observed a subset of the SDSS damped 
Lya sample for other scientific studies (e.g. chemical evo- 
lution and C II* analysis; Prochaska et al. 2003b; Wolfe 

3 In general this corresponds to the first data release when the 
quasar was observed by SDSS although errors in our bookkeeping 
may have delayed its analysis. 
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Fig. 4. — Comparison of the Nm values for a sub-set of the 
SDSS-DR3 sample as measured from the SDSS quasar spectra and, 
independently, data acquired with the Echellette Spectrometer and 
Imager. 
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Fig. 5. — Histogram of the JVhi values for every damped Lya 
candidate fit in the SDSS-DR3 sample. Note that the distribution 
peaks at ./Vjji < 2 X 10 20 cm -2 suggesting a high level completeness 
at the damped Lya threshold (dotted line). We currently estimate 
the completeness level to be ~ 99%. 



et al. 2004) with the Echellette Spectrometer and Im- 
ager (ESI; Sheinis et al. 2002). Figure 4 compares the 
values of the SDSS fits with those derived from the ESI 
spectra. The agreement is excellent. It is important to 
note, however, that the fits were performed primarily by 
one of us (JXP) and that the systematic error related to 
continuum placement is likely correlated. Similarly, the 
effects of line-blending are qualitiatively similar for the 
two data sets. Indeed, the reduced \1 — 0.4 indicates 
the estimated uncertainties significantly exceed the sta- 
tistical uncertainty. There is a notable case from PH04 
(at z = 2.77 toward J084407.29+515311) where the SDSS 
value is 0.3 dex higher than the ESI value due to improper 
continuum placement in the SDSS analysis. We believe 
this one case was the result of our initial inexperience with 
the SDSS spectra and that such errors are very rare in the 
current analysis. Nevertheless, it does underscore the fact 
that the error in the TVhi values are predominantly sys- 
tematic and in some cases large. 

One qualitative assessment of sample completeness is 
an inspection of the JVhi distribution. Figure 5 presents a 
histogram of -ZVhi values for all of the fitted damped Lya 
candidates. The distribution shows a steady increase to 
lower JVhi value which extends just below 2 x 10 20 cm~ 2 . 
We note that the relatively steep decline in the distribu- 
tion at TVhi < 10 20 2 cm -2 is partly due to incompleteness 
in our search algorithm. It is also due to the fact that 
we eliminated many damped Lya candidates after visual 
inspection. 

Since the publication of PH04, we have performed a 
damped Lya survey on a new set of mock spectra kindly 
provided by P. McDonald. These mock spectra were care- 
fully constructed to match the SNR and redshifts of the 
full SDSS-DR3 quasar sample. A synthetic Lya forest 
was added to the quasar continuum according to the pre- 
scriptions detailed in McDonald et al. (2005a). Finally, a 
random set of damped Lya systems (with a column den- 
sity frequency distribution similar to the results below) 



were added to the spectra and also a set of Lyman limit 
systems with iVni = 1 x 10 17 to 2 x 10 20 cm~ 2 following 
the prescriptions of McDonald et al. (2005b). We an- 
alyzed these mock spectra using the same algorithms as 
the scientific search and recovered 99% of the damped 
Lya systems. The few that we did not 'discover' have 
A^hi < 10 20 - 4 cm -2 and just missed satisfying the damped 
Lya candidate criteria. Furthermore, we recovered 90% 
of the absorbers with Njjj ~ 1 x 10 20 cm~ 2 . Therefore, 
we are confident that the sample is nearly complete al- 
though it is likely we are missing w 5% of the absorbers 
with N m k 2 x 10 20 cm- 2 . 

To further investigate uncertainty in our iVni measure- 
ments, we fitted Voigt profiles to 50 of the damped Lya 
systems from the mock spectra. With only one exception, 
our fitted values are within 0.3 dex of the true value. In 
this one exception we fitted one damped Lya profile to 
a case that was actually two damped systems separated 
by w 500 km s -1 . In practice, this situation can be ac- 
counted for if both damped Lya systems exhibit metal- 
line absorption. In any case, we expect these systems to 
be rare (Lopez & Ellison 2003), but a non-zero system- 
atic error to the damped Lya analysis. In future papers, 
we will expand this type of mock analysis to probe other 
aspects of fitting errors and completeness limits for the 
SDSS samples. 

4. RESULTS 

In this section, we report the main results of the damped 
Lya survey. These include the H I frequency distribution 
function and its zeroth and first moments. It is amusing to 
note that many of the techniques are directly analogous to 
the derivation and analysis of galaxy luminosity functions. 
Indeed, we rely on many of the standard techniques and 
face similar challenges and systematic uncertainties. 
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Fig. 6. — The H I frequency distribution fm(N, X) for all of the damped Lyct systems identified in the SDSS-DR3-4 sample (mean redshift 
z = 3.06). Overplotted on the discrete evaluation of fm(N, X) are the fits of a single power-law, a T-function, and a double power-law. Only 
the latter two are acceptable fits to the observations. 
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4.1. fm{N,X): The H I Column Density Frequency 
Distribution 

Following previous work (e.g. Lanzetta et al. 1991), we 
define the number of damped Lya systems in the intervals 
(N, N + dN) and {X, X + dX): 



f m (N,X)dNdX 



(3) 



where fm{N, X) is the frequency distribution and the 'ab- 
sorption distance' 



dX 



go 

H(z) 



{l + zfdz 



(4) 



With this definition, an unevolving population of objects 
will have constant fm{N, X) in time provided one adopts 
the correct cosmology. 

In Figure 6, we present f m (N,X) for the full SDSS- 
DR3_4 survey. This sample spans the redshift interval z = 
[2.2, 5.5] with an integrated absorption pathlength AX = 
7417.5 and a Am-wcighted mean redshift of 3.2. The solid 
points with error bars describe fm(N,X) for Nm bins of 
AN = 0.1 dex: 



fm(N,X) 



m DLA (N,N + AN) 
AX 



(5) 



with m DL A the number of damped Lya systems within 
(N, N + AN) in the AX interval. The error bars reflect 
Poisson uncertainty (68.3% c.l.) according to the value of 
rnDLA and the upper limits correspond to 95% c.l. The 
derivation of fm(N,X) in this discrete manner is analo- 
gous to the derivation of galaxy luminosity functions using 
the Vmax method. 

Lacking a physical model, we have considered three func- 
tional forms to describe fm(N,X): (i) a single power-law 



fm(N,X) = k 1 N ai , 
(ii) a T-function (e.g. Fall & Pei 1993) 

/ N \ Q2 f -N 
fMN,X) = k 2 (-) exp( — 

and (iii) a double power-law 



(6) 



(7) 



a 2 = 



N 



N 



(10) 



with N = 21. In some cases whese values are close to the 
one-parameter confidence limits derived from the maxi- 
mum likelihood function although there are notable ex- 
ceptions. We calculate the correlation matrix in standard 
fashion (see Blanton et al. 2003), 



< AxiAxj > 



(< Ax\ >< Ax? >y 



with 



< AxiAxj >- 



N 



N 



Xi)(Xj 



(11) 



(12) 



f m (N,X) = k 3 



N 

~Nd 



where (3 = { a 3 : N < N d ; a 4 : N > 

(8) 



In each case, we perform a maximum likelihood analysis 
to constrain the functional parameters and set the nor- 
malization constants (ki,k2,k 3 ) by imposing the integral 
constraint 



oo 

/ 



f(N,X)dN = 



AX 



(9) 



To estimate the parameter uncertainties and construct 
a correlation matrix, we performed a 'jack-knife' analysis 
(e.g. Lupton 1993; Blanton et al. 2003). Specifically, we 
derived the best fit parameters for 21 subsamples ignoring 
222 random quasars in each case. The parameter uncer- 
tainty is then given by 



As is typically the case for fits to galaxy luminosity func- 
tions, we find significant correlation between the parame- 
ters. It is important to keep this in mind when compar- 
ing the fits to various redshift intervals. Table 6 presents 
the best-fit values for the parameters for the full SDSS- 
DR3 sample and also the fits to the damped Lya systems 
in several redshift intervals. In this table, the error bars 
refer to 'one-parameter uncertainties' that correspond to 
the 68% c.l. of the maximum likelihood function when 
keeping the other parameters fixed at their best-fit val- 
ues. The exceptions are the normalization values where 
we have only reported the Poissonian error based on the 
best fit. Table 7 provides the parameter uncertainties and 
the correlation matrix for the full SDSS-DR3_4 sample. 
When the absolute value of the off-diagonal terms is much 
less than 1, then there is little correlation between the pa- 
rameters. Unfortunately, there are too few damped Lya 
systems in the redshift bins to perform a meaningful jack- 
knife analysis for these subsets. We suspect, however, that 
the parameters are correlated in a similar way as to the 
results presented in Table 7. 

The best-fit solutions are overplotted on the binned eval- 
uation of fm(X, X) in Figure 6. First, consider the single 
power-law solution (dotted line) with a best fit slope of 
cti = —2.19 ± 0.05. This functional form is a poor de- 
scription of the data. A one-sided Kolmogorov-Smirnov 
(KS) test indicates there is a less than 0.1% probabil- 
ity that the cumulative distributions of the observations 
and the power-law are drawn from the same parent pop- 
ulation. In short, the power-law is too steep at low A%i 
and too shallow at large A%i- Although previous surveys 
suggested a single power-law was a poor description (e.g. 
Wolfe et al. 1995), their sample size was too small to 
rule out this solution. Note that this result contrasts the 
damped Lya systems with the Lya forest (absorbers with 
Njjj < 10 15 cm~ 2 ) where a single power-law with exponent 
ai w —1.5 is a good description of the observations (e.g. 
Kirkman & Tytler 1997). 

Although a single power-law is a poor description of the 
observations, the fit does highlight an important new re- 
sult: the fui{N,X) distribution is steeper than a N^f 
power-law at large column density. We will further de- 
velop this point in § 4.3. The other two curves on Figure 6 
show the T-function (dashed line) and double power-law fit 
(dash-dot line). Both of these functional forms are a fair 
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Table 6 
FITS TO fm(N,X) 



Form Parameters ze[2.2,5.5) a ze[1.7,2.2) ze[2.2,2.5) ze[2.5,3.0) ze[3. 0,3.5) ze[3.5T5lT) 



zo ' iu -0 .02 

z - iy -0.05 
_oo c.9+0.02 
zo - dz -0.02 

21.48±g;Sg 

_ 9 o go+0.02 
zo.oo_ 02 

21 ™+ - 05 
zi.ou_ 04 

-2.ooi8:8S 
-6.ool3i:8g 



2i.8il8:8g 

_ 9 -I o+O.lS 
z ' io -0.23 

_9q n>?+ 08 
zo.uo_ 08 

91 qi+0.35 
Zl.Ol_ 15 

-1 56 +0 13 
l.OD_ 27 

-24 09 +0 - 08 

21 fi5 +al5 
-1 96+ 023 

i.»u_q 26 

-10.00l 7 ;^ 



07 1 FFTTD3 
z/ - lo -0.04 

-2 39+ 013 

z ' oy -0.15 

-23 40+ ' 04 

91 01 +0.29 
zi - ol -0 .12 
_1 q 4 +0-16 

-24 16+ - 04 

Z*.1D_ 04 

21 ™ +0 - 18 

Zl.OU_0.07 
9 9 r+0.16 
z - zo -0.17 

-10-oolES 



99 1 9+O.O3 
zz.iz_ 03 

_9 14 +0.08 

90 7^+0.03 

zo - ' u -0.03 

21.5618:18 

-1.781H8 
-23 69+ 03 
21 45+ - 12 

-1.9418:18 
__________ 



20.84+' ^ 3 
9 ns+ - 08 

90 no+0.03 

ZO.UO_ 03 

Zl.ou_ 08 



90 00 + 1J.U4 

ZO.OO_o.o4 

-2 20+ ' 10 
z - zu -o.n 

-23 qq+ 04 

ZO.»»_ 04 
1+0.14 
0.14 



Single 
Gamma 

Double 



logfci 

ai 
log fc 2 
log7V 7 

logfc 3 
logJV_ 

Qf4 



-23 ^1+ 03 

ZO.01_o.o3 
Z1.40_o.o6 

1 01+0.10 

-l.8l_0.__ 
c 79+4.O2 
°- ' z -3.81 



21.69: 



_1 C9+0.09 _, no+0.12 
1 - dz -0.11 1 - yo -0.12 



_ 9 o 7^+0.04 
ZO. fO_ 04 

21 ^n+ - 20 

Zl.OU_ .07 

-2 03+° 12 

z - uo -0.13 

-4-78l 2 2 :g 



"Restricted to the SDSS-DR3-4 sample. The remaining columns include the full sample. 

Note. — The errors reported are one-parameter errors which do not account for correlations among the parameters. See Table 7 for the 
correlation matrix. 



Table 7 

CORRELATION MATRIX FOR f m (N,X) 



Param 


a 


S log ki 


Sai 


5 log k 2 


S log Nj 


<5c_ 


S log k 3 


SlogN d 


<5a 3 


(5a4 


5 log ki 


1.04 


1.00 


-1.00 


-0.48 


0.28 


-0.80 


-0.33 


-0.01 


-0.93 


-0.06 


Sai 


0.05 


-1.00 


1.00 


0.48 


-0.29 


0.80 


0.32 


0.02 


0.93 


0.05 


S log fc 2 


0.29 


-0.48 


0.48 


1.00 


-0.98 


0.91 


0.47 


-0.25 


0.66 


-0.28 


S log N 7 


0.10 


0.28 


-0.29 


-0.98 


1.00 


-0.80 


-0.45 


0.29 


-0.50 


0.29 


5a2 


0.11 


-0.80 


0.80 


0.91 


-0.80 


1.00 


0.48 


-0.17 


0.89 


-0.15 


S log k 3 


0.18 


-0.33 


0.32 


0.47 


-0.45 


0.48 


1.00 


-0.93 


0.61 


0.53 


8\ogN d 


0.08 


-0.01 


0.02 


-0.25 


0.29 


-0.17 


-0.93 


1.00 


-0.29 


-0.64 


8a 3 


0.07 


-0.93 


0.93 


0.66 


-0.50 


0.89 


0.61 


-0.29 


1.00 


0.09 


6014 


2.13 


-0.06 


0.05 


-0.28 


0.29 


-0.15 


0.53 


-0.64 


0.09 


1.00 



Note. - 



Restricted to the SDSS-DR3.4 sample. 
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fit to the observations; a one-sided KS test gives values 
> 10% for the full la range of the parameters. Further- 
more, there is good agreement between the 'break' col- 
umn densities (iV 7 and Nd) and the power-law indices at 
low column densities are consistent. Both solutions in- 
dicate that fm(N, X) drops very steeply (a <C —2) at 
A/hi ~ 10 215 cm -2 and that the distribution has a 'faint- 
end' slope of a ~ —2. 



N„,=[20.30,20.60] 



N„,=[20.60,21.00] 



F !■■ 



N„,=[21.00,21.40] 



N„,=[21.40,21.80] 



2.0 



2.5 



3.0 



3.5 



4.0 



4.5 



Fig. 7. — Plot of the TVhi frequency distribution of damped Lya 
systems in 5 redshift intervals against the full distribution. The four 
TVhi intervals are: (1) 20.3 < logA^l < 20.6; (2) 20.6 < logjV H i < 
21.0; (3) 21.0 < logAf m < 21.4; and (4) 21.4 < log7V H i < 21.8. 
One observes significant evolution in fm{N, X) at low Nm value 
but that the shape of fm(N,X) is nearly invariant with redshift. 
Finally, note that the data points for z < 2.2 (marked with a cross) 
do not include measurements from the SDSS survey. 



The zeroth moment of fm(N,X) gives the number of 
damped Lya systems encountered per unit absorption path- 
length dX, i.e. the line-density of damped Lya systems 4 : 



£ B LA(X)dX 



f m (N,X)dNdX 



(13) 



N th 



The line density is related to the comoving number density 
of damped Lya systems Udla{X) and the cross-section 
A(X) as follows: 



Idla(X) = {c/H )n DLA (X)A(X) 



(14) 



In this manner, ^dla is related to the covering fraction of 
damped Lya systems on the sky. 

Figure 8 shows £i)^A(X)dX for the damped Lya sys- 
tems at z > 1.6 and for the local universe (Zwaan, Briggs, 
& Sprayberry 2001; Rosenberg & Schneider 2003; Ryan- 
Weber, Webster, & Staveley-Smith 2003, 2005). The val- 
ues for the damped Lya systems were calculated in the 
discrete limit, e.g. 



>-DLA 



(X) 



m DLA 
AX 



(15) 



and the error bars reflect Poisson uncertainty in tudla 
(68.3% c.L). The solid line traces the value of ^dla(AT) 
derived in a series of 0.5 Gyr time intervals to reveal any 
differences in binning. 

It is evident from the figure that the line density of 
damped Lya systems increases from z = 2 to 4; the in- 
crease between the z — [2.2, 2.5] interval and the z — 

4 Previous work has represented this quantity with various termi- 
nology that is often confused with the column density or number 
density (e.g. dN/dX, dn/dz). 



In the following sub-sections, we will consider evolu- 
tion in the zeroth and first moments of fm(N,X). Fig- 
ure 7 qualitatively describes the redshift evolution of the 
full f m {N,X) distribution. The figure plots f m (N,X) as 
a function of redshift against the fm(N,X) distribution 
of the full SDSS-DR3 sample in four A/hi intervals: (1) 
20.3 < logATffl < 20.6; (2) 20.6 < log A/hi < 21.0; (3) 
21.0 < logA^i < 21.4; and (4) 21.4 < logA^i < 21.8. At 
low AThi, fm(N,X) increases monotonically with redshift. 
The peak-to-peak range is modest, however, only evolving 
by a factor of « 2. One identifies similar evolution in the 
high AThi bins with the exception of the highest redshift 
interval. 

Perhaps the most important result is that the shape of 
fm{N,X) is independent of redshift. We have compared 
the fm(N,X) distributions from each redshift bin using a 
two-sided Komolgorov-Smirnov (KS) test. This test com- 
pares the shape of the distributions but is insensitive to 
the normalization. Our analysis found KS probabilities 
greater than 10% for every pair of redshift intervals. Con- 
trary to previous claims, therefore, there is no evidence 
for a significant evolution in /hi (N, X) for the DLAs with 
redshift (e.g. a steeping beyond z = 3). With the cur- 
rent sample, there may only be significant evolution in the 
normalization of fm(N,X). 

4.2. ^dla(^)^A^: The Incidence of Damped Lya Systems 



0.08 : 
1 0.06 - 




0.04 : 1 

0.02; 

0-00 L i i i 

1 2 3 4 5 
z 

Fig. 8. — Plot of the line density of damped Lya systems ^dlaPO 
versus redshift. The points at z = are from three analyses of 21cm 
observations by Zwaan, Briggs, & Sprayberry (2001); Rosenberg & 
Schneider (2003); Ryan- Weber, Webster, & Staveley-Smith (2003, 
2005). The curve overplotted on the data traces the evolution of 
^dlaPO m a series of 0.5 Gyr intervals. Contrary to previous stud- 
ies (which focused on the line density in redshift space ^dla(z))> 
we find statistically significant evolution in the line density per unit 
absorption distance. 
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[3.5, 5, 5] bin is a factor of 1.7 ± 0.2. The largest change in 
^dla(A) occurs during a Az = 1 interval from z ft 2.3 to 
3.3 corresponding to At = 800 Myr. We discuss possible 
explanations for this evolution in § 6. Another interesting 
result is that £-dla(z ft 2.3) is less than 2a larger than 
^dla(z = 0), i.e. the data suggest little evolution in the 
line density of damped systems over the past ft lOGyr. 
Within the context of hierarchical cosmology, this is a 
surprising result. It implies that when smaller galaxies 
merge and accrete other systems, the product of the co- 
moving density and the total gas cross-section satisfying 
the damped Lya criterion is conserved. Perhaps this is 
related to the fact that the damped Lya threshold is near 
the surface density threshold for star formation. 

4.3. Q g : The Cosmological Neutral Gas Mass Density 

4.3.1. Definition of fig 

The first moment of the /hi (A, A) distribution gives a 
cosmologically meaningful quantity, the gas mass density 
of H I atoms: 



n q iI {X)dX = 



fmi H H 
cp c 



N„ 



N m f m (N,X)dX (16) 



where \i is the mean molecular mass of the gas (taken 
to be 1.3), Hq is Hubble's constant, and p c is the criti- 
cal mass density. Traditional treatments of ft™ and the 
damped Lya systems have integrated Equation 16 from 
N min = 2 x 10 20 cm- 2 to N max = oo yielding ft? LA As 

emphasized by Peroux et al. (2003), fi? LA may be signif- 
icantly less than fi™ if absorbers below the damped Lya 
threshold contribute to the H I mass density. In this case, 
fig 11 would include a large contribution from gas which is 
predominantly ionized because the H I atoms in the Lyman 
limit systems are a mere tracer of the gas. It is difficult, 
however, to assign any physical significance to f2™ aside 
from a mere census of the H I atoms in the universe. In 
contrast, SlP LA offers a good estimate of the mass density 
of gas which is predominantly neutral (see below). 

In the following, we will consider both quantities with 
primary emphasis on f2P LA because we contend it best 
defines the gas reservoir available for star formation at 
high redshift. In practice, one generally evaluates f2? LA 
in the discrete limit 



DLA 



fim H H £A ffl 



cp c 



AX 



(17) 



where the sum is performed over the Ahi measurements of 
the damped Lya systems in a given redshift interval with 
total pathlength AA. 

In the following, we will consider several definitions for 
Q g based on the values of N m i n and N max . Table 8 sum- 
marizes the various definitions. 

4.3.2. Convergence of fl g 

Consider first the upper limit, N max . To verify fl g con- 
verges, it is necessary to integrate /hi (A, A) until 

d log /hi (A, A) 



d log A H i 



(18) 



Table 8 
DEFINITIONS OF 



Def. 


A ■ 
( cm -2 ) 


Nmax 

( cm -2 ) 


Description 


n m 

f^T-JLA 
^LLS 
o2lcm 




?? 

2 x 10 20 
1.6 x 10 17 


oo 
oo 
oo 
2 X 10 20 


Density of gas associated with H I atoms 
Density of predominantly neutral gas 
Density of gas associated with the DLA 
Density of the gas associated with LLS 
Density of H I gas in collapsed objects 



i.e. until one establishes that /hi(A, A) is significantly 
steeper than a A^ -2 power-law. As noted in § 4.1 and 
Figure 6, the T-function and double power-law fits to the 
/hi (A, A) distribution indicate /hi (A, A) steepens at Ahi ft 
10 215 cm~ 2 . Furthermore, even a single power-law fit to 
the data almost satisfies Equation 18. This point is empha- 
sized in Figure 9 which presents the exponent of the best 
fitting single power-law to the SDSS-DR3 /hi (A, A) distri- 
bution function as a function of the lower Ahi bound to the 
distribution function, e.g., the point at log A HI = 21 shows 
the exponent for the single power-law fit to /hi (Ahi > 
10 21 cm" 2 ; A). Not surprisingly, the curve starts at Ahi = 
2 x 10 20 cm" 2 with a\ = —2.2 and decreases with increas- 
ing A HI . In addition, it is important that the value of ai 
decreases below —3 in Figure 9 because a power-law with 
ai = —2.2 converges very slowly with N max . 

Figure 10 shows the cumulative JlP LA value against log N„ 
Both the double power-law and T-function converge to the 
value indicated by the discrete evaluation of fiP LA (Equa- 
tion 17). This adds additional confidence that these func- 




20.5 



21.0 

log Ki 



21.5 



22.0 



Fig. 9. — The solid line plots the maximum likelihood value of the 
exponent for a single power-law fit to the /hi(-?V, X) distribution of 
the SDSS-DR3-4 sample as a function of the minimum A^hi value 
of the distribution. The value is less than —2 for all values of Af^ : 
and the curve indicates the power-law steepens as one restricts the 
analysis to the high A r jji end of the distribution function. The dotted 
curve, meanwhile, plots the exponent value as a function of the 
maximum Ahi value of the /hi(^i^) distribution. There is little 
evolution in this value although the slope is more shallow at the low 
Ajji region of the distribution. 
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Fig. 10.— Cumulative evaluation of f2° LA for the SDSS-DR3.4 
sample as a function of the maximum JVhi value. The black curves 
trace the evaluation of f2^ LA in the discrete limit (Equation 17) 
while the smooth curves show the evaluation by integrating the func- 
tional fits to the /hi(^V X) distribution. Both the double power-law 
and T-function converge to the discrete value by Nm = 10 22 cm~ 2 . 
An extrapolation of the single power-law, however, has not con- 
verged even at Nm = 10 23 cm -2 . 



tional forms are good representations of fm(N,X). The 
single power-law, however, has not converged at the high- 
est ./Vhi value observed for a damped Lya system; it im- 
plies significantly larger fff LA values than currently sup- 
ported by the data. Nevertheless, even this functional form 
converges formally as N max — > oo: a>i < —2 at 3cr sig- 
nificance. The results presented here are the first defini- 
tive demonstration that LA converges and most likely 
by Nm ~ 10 22 cm -2 . We emphasize that smaller redshift 
intervals (i.e. subsets of the SDSS-DR3 sample) have insuf- 
ficient sample size to convincingly argue for convergence. 
Therefore, one may cautiously view the f^ LA values pre- 
sented as a function of redshift. 

Now consider the lower bound to the integrand, N m i n . 
One evaluates f2^ 1 with N m i n = 0, i.e. including the contri- 
bution from all classes of quasar absorption line systems. 
We have argued that this definition for Vl g lacks special 
physical significance, yet let us first consider its value. In 
particular, we will investigate the contribution of fl^ LA 

to ti m . Tytler (1987) first demonstrated that the H I 



al. (2003) claimed that Lyman limit systems with Nm > 
10 19 cm- 2 contribute w 50% of ilf 1 at z > 3.5. In PH04, 
we argued that Peroux et al. (2003) overstated the con- 
tribution of these 'super-LLS' to f2™ and, in particular, 
underestimated the uncertainty of their contribution. Let 
us return to this issue with the increased sample of SDSS- 
DR3. 
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Fig. 11. — Contribution to f2)p of the Lyman limit systems rela- 
tive to the damped Lyc* systems as a function of the minimum TVjji 
value of the Lyman limit systems. The top two curves refer to ex- 
trapolations of the single power-law and T-function fits to fm(N, X) 
for the damped Lya systems. The single-power law is divergent 
and even the T-function has not converged at 10 17 ' 2 cm - 2 . These 
extrapolations, however, significantly overpredict the line density 

of Lyman limit systems and should be considered upper limits to 
OLLS/^DLA The , ower 

curve is the result for a single power-law fit 
to fm(N,X) in the TVjji interval for Lyman limit systems. Because 
the first derivative of this single power law is not continuous at the 
damped Lya system threshold or the transition to the Lya forest, 
it should be considered a lower limit to Qg LS /n^ LA . 



The first point to stress is that d log /hi (N, X)/d log -/Vhi f 
-2 at ./Vhi ~ 10 20 3 cm~ 2 . This is sufficiently steep that 
absorbers with Nm ~ 10 20 cm -2 must contribute at least 
a few percent of Indeed, if we extrapolate our fits 

to fm(N,X) for the damped Lya systems from Nm = 
2 x 10 20 cm~ 2 to 10 17 cm~ 2 , we find the Lyman limit sys- 
tems would give a larger neutral gas mass density than 
the value of the damped Lya systems. Figure 11 presents 



frequency distribution of the Lya forest (absorbers with the contribution of the Lyman limit systems to O g rela- 
Nm < 10 17 cm -2 ) is well described by a single power-law 



with exponent aL yc 



T.5. More recent studies give sim- 



ilar results over the Nhi interval 10 12 - 10 15 cm" 2 (e.g. Hu 
et al. 1995; Kirkman & Tytler 1997). This functional 
form is sufficiently shallow that the contribution of the 
Lya forest to fl^ 1 is negligible in comparison to absorbers 
with 7V H i > 10 15 cm- 2 . 

4.3.3. Impact of the Lyman Limit Systems 

Of greater interest to this discussion is the contribution 
of the Lyman limit systems 5 to O™. Indeed, Peroux et 

5 Lyman limit systems are defined to be all absorbers with optical 
depth exceeding unity at 1 Ryd. In this discussion, we generally 



tive to the damped Lya systems fig LS /J7g LA . The upper 
curves show the results for extrapolations of the the sin- 
gle power-law and the T-function fits to fm(N, X) for the 
damped Lya systems. For the single power-law, the gas 
mass density diverges as N m i n — > because a\ < —2. 
Even with the T-function (with exponent a2 = —1.8) the 
Lyman limit systems dominate the contribution to O™. 

There is, however, an additional observational constraint 
which limits Q^ hS /tt® LA . Although the functional form of 
/hi(-/V, X) is essentially unconstrained for the Lyman limit 
systems, their line density, 

restrict the definition to be absorbers with 17.2 < log(7Vjji/ cm~ 2 ) < 
20.3. The one exception is the line density, as defined in Equation 19. 
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£ LLS (X) = £(N> 10 



17.2 



(19) 



has been measured at z > 2 (Peroux et al. 2001). The 
extrapolations of the single power-law and T-function fits 
to the damped Lya frequency distribution overpredict the 
observed value of the incidence of Lyman limit systems by 
more than an order of magnitude. Therefore, the estimate 
of f2g LS for these curves should be considered unrealistic 
upper limits. 

We can set a lower bound to il^ hs by assuming /hi (AT, X) 
is a single power-law over the 10 17 ' 2 cm~ 2 < Am < 10 20 ' 3 cn 
interval: 



recall that the shape of fm{N, X) is statistically invariant 
with redshift (Figure 7). Therefore, it is likely that the 
Lyman limit systems offer a non-negligible contribution 
to at all redshifts. To determine the value of Q^ hs 
as a function of redshift, one compares the relative evolu- 
tion of ^lls and ^dla- Current observations suggest £lls 
has a stronger redshift dependence than £ula- In fact, 
we do find fl^ LS (as determined from the single power-law 
fit to /hP) to increase with redshift (Table 9). There is 
a comparable increase in S!g LA , however, and the ratio 
£l^ LS /fff LA is roughly constant in time. 



f£ S (N,X) = k 



LLS 



N 



10 20 - 3 cm- 2 



(20) 



The two parameters of this function are determined by 
demanding 



/^ B (10 2O - 3 cm- 2 ;X) = /^ A (10 2a3 cm- 2 ;X) (21) 



and 



X 



o 



10" 



f£ s dNdX =£ LLS dX 



£DLAdX 



(22) 



10 17.2 



To estimate £~lls(X), we adopt the functional form pro- 
vided by Peroux et al. (2003) as a function of redshift 
£~lls(z) — 0.07(1 + z) 2 - 45 , and adopt the mean value for 
the entire SDSS-DR3 survey, i.e. 



tLLS 



(X) 



J £hhs(z)g(z)dz 



AX 



(23) 



The uncertainty in £lls( z ) implies an approximately 
20% uncertainty in 4lsP0- To estimate f^ A ( N = 10 20 3 cm 
and determine ^lls, we adopt the central value of our 
P-function fit. The relative contribution of f2g LS /fi° LA , 
of course, is relatively insensitive to this parameter. For 
the full SDSS-DR3.4 sample, we find log/c L LS = -21.43 
and QfLLS = — l-Olio o2- ^ ^ s n °t coincidental that this 
slope matches the best-fit values of the 'faint-end' slope 
for the T-function fits to /hi(A, A) performed by Storrie- 
Lombardi & Wolfe (2000) and Peroux et al. (2003). 
Their maximum likelihood analyses included the integral 
constraint imposed by the Lyman limit systems and the 
greater than three orders of magnitude lever arm in Ahi 
forces this result. 

Because the LLS power-law is more shallow than the 
'faint-end' slope of the fits to /hi LA , the estimate of fl^ LS 
based on f^ s must be considered a lower limit. The 
Slg LS /f2g LA curve for the single-power law fit to f^ s is 

also shown in Figure 11. We find f^ LS > 0.3f2g LA and 
that the majority of the contribution comes from absorbers 
with Ahi > 10 19 cm -2 . As noted above, Peroux et al. 
(2003) first cautioned that the super-LLS could contribute 
significantly to especially at very high redshift. In 

PH04, we argued that Peroux et al. (2003) had overstated 
the effect at z > 3.5 and underestimated the uncertainty. 
Our new results indicate the super-LLS contribute from 20 
to 50% to ft m for the complete SDSS-DR3 survey. Also, 
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Fig. 12. — Figure depicting the /hi(-?V, X) distribution of the 
quasar absorption line systems at z = 2.7. The distribution for 
the damped Lya systems corresponds to the T-function fit to the 
SDSS-DR3.4 sample. The distribution for the Lya forest (N m < 
10 15 cm - 2 ) is taken from Kirkman Sz Tytler (1997) scaled to the 
ACDM cosmology. We show three curves for the Lyman limit system 
_2 r);gime which should only be considered as a sketch: (1) the dotted 
curve is the single power-law fit to - It is constrained to match 
the incidence of Lyman limit systems at z ~ 2.7; (2) the dashed 
curve is a spline function constrained to match the Lya forest and 
damped Lya systems. It clearly overpredicts the incidence of Lyman 
limit systems; and (3) the solid curve is a spline function fit to the 
Lya forest and damped Lya systems and also constrained by a single 
point at Nm = 10 17 cm~ 2 defined such that the integral constraint 
for the Lyman limit systems is satisfied. 



We reemphasize that the value of f2^ LS derived from 
the single-power law fit to /hi is an underestimate of 
the true value. In Figure 12, we show a spline solution 
which matches the fm(N,X) functional forms of the Lya 
forest at A^hi = 10 15 cm -2 and the damped Lya systems 
at Ahi = 2 x 10 20 cm~ 2 and also gives the correct line 
density of Lyman limit systems. We can give an esti- 
mate of the LLS contribution based on this spline func- 
tion: fftLS/^DLA = q 57 Therefore, the likely contribu- 



tion of the Lyman limit systems to fij? is 



50%fi° LA and 
they comprise « 1/3 of f2™. In § 6, we consider an alter- 
nate definition of the gas mass density: the mass density 
of gas that is predominantly neutral, £!g CUt . Under this 
definition, one may disregard the majority of the Lyman 
limit systems on the grounds that they are highly ionized 
(Prochaska & Wolfe 1996; Prochaska 1999; Prochaska & 
Buries 1999; Dessauges-Zavadsky et al. 2003), but we 
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suspect that at least a subset with TVhi « 10 20 cm -2 is 
significantly neutral and could feed star formation. This 
neutral subset, however, may contribute only a few per- 
cent relative to fi^ eut and we contend that the damped 
Lya systems contain > 80% of f2^ eut at high redshift. 
Until a robust determination of fm(N, X) and the ioniza- 
tion state of the gas in the Lyman limit systems is made, 
the only conservative practice is to restrict the discussion 
of d™ to include only those regions of the universe with 
surface density N m > 2 x 10 20 cm" 2 (i.e. fff LA ). For the 
near future, this is the only quantity that will be precisely 
measured. 

As an aside, consider the implications of the culls value. 
First, this value is significantly more shallow than the 
power-law dependence of the Lya forest. Second, it is 
shallower than the 'faint-end' slope of the damped Lya 
fm(N,X) distribution. If we demand that fui{N,X) and 
its first derivative are continuous across the Lya forest/LLS 
and LLS/DLA boundaries, then our results indicate that 
/hP has an inflection with dlog/jjP/cf log N > -1 (Fig- 
ure 12). It is not surprising that /gj would have an un- 
usual functional form given that it spans the regime from 
highly ionized gas to primarily neutral gas. In fact, the in- 
flection indicated by our analysis resembles the functional 
forms suggested by Zheng & Miralda-Escude (2002) and 
Mailer et al. (2003). In passing, we advertise that a prin- 
cipal goal of the MIKE/HIRES Lyman Limit Survey is to 
study /hP and the ionization state of the Lyman limit sys- 
tems (Prochaska, Buries, Prochter, O'Meara, Bernstein, & 
Schcctman, in prep.). 

4.3.4. ftDLA Evolution 

Let us consider 6 the redshift evolution of fl^ LA by eval- 
uating Equation 17 in a series of redshift intervals. Fig- 
ure 13 presents the results. Following PH04, we estimate 
the uncertainty in LA through a modified bootstrap 
technique. Specifically, we perform 5000 trials where we 
randomly draw p damped Lya systems from the observed 
sample where p is a normally distributed random number 
with mean equal to the number of damped systems in the 
interval (mo la) and with variance also equal to ttldla- 
We calculate f2g LA for each trial and calculate the upper 
and lower values corresponding to 68.3% of the distribu- 
tion. 

The results presented in Figure 13 present the first sta- 
tistically significant evidence that fi^ LA evolves with red- 
shift. Comparing the measurements in the z = [2.2, 2.5] 
and z — [3., 3.5] intervals, we find a 50 ± 10% decrease in 
the lowest SDSS redshift bin. This evolution occurs dur- 
ing the same time interval when the decrease in €dla(A) 
is observed (Figure 8). Regarding the line density, we have 
attributed the decline to a decrease in the typical cross- 
section of the damped Lya systems. Given the decrease in 

LA it is evident that the gas is not simply contracting 
into smaller structures (e.g. via dissipative cooling), but 
that the gas is being consumed and/or expelled from the 
system. 

6 The reader should refer to § 5.2 for a discussion of an important 
systematic effect related to . For the remainder of this paper, 

we present results which ignore the effect. Our present concern is 
that we may be overestimating nJ? LA . 




Fig. 13. — Neutral gas mass density of the damped Ly« systems 
alone as a function of redshift. There is an increase of approximately 
a factor of two in f2j^ LA from z = 2 to 3 with the majority of rise 
occurring in only 500 Myr. Note the data point at z < 2.2 (marked 
with a cross) does not include measurements from the SDSS survey. 



Contrary to previous claims, we find no evidence for 
evolution in 0° LA at z > 3.5 although a modest increase 
(or decrease) is permitted by the observations. Similarly, 
the results at z « 2 from previous work are consistent (al- 
beit uncomfortably high 7 ) with the z = 2.3 data point. 
We also caution the reader that the results at z > 4 
should be confirmed by higher resolution observations. At 
these redshifts, blending in the Lya forest is severe and 
we worry that the SDSS spectroscopic resolution is insuf- 
ficient. In passing, we also note that a significant contri- 
bution to LA at z > 4 comes along the single sightline 
to J162626. 5+275132 which shows two N m ~ 1 21 cm" 2 
damped Lya systems at z « 5 and even two additional 
systems with comparable Nm value at z <~ 4.5 which is 
not in the statistical sample. 

4.4. Summary 

Table 9 presents a summary of /ni(iV, X) and its ze- 
roth and first moments for a series of redshift intervals. 
A qualitative summary of the new results is that they are 
in broad agreement with the previous analyses based on 
substantially smaller datasets (Wolfe et al. 1995; Storrie- 
Lombardi & Wolfe 2000; Peroux et al. 2003). With the 
increased sample size, however, we have measured evolu- 
tion in the incidence and H I content of these galaxies at 
z > 2 to unprecedented precision. 



7 Note that the value for LA is reduced by « 50% if one re- 
moves the single damped system at z = 2.04 toward Q0458— 02. This 
stresses the sensitivity of these low redshift results to small number 
statistics (as stressed by Chen et al. 2005) and the bootstrap er- 
ror estimate may be overly optimistic in this case. Coincidentally, 
Q0458— 02 is an optically variable quasar which in its low state has 
a magnitude below the original survey detection limit. Also note 
that this redshift interval does not include any data from the SDSS 
survey. 



Table 9 
SUMMARY 
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5. SYSTEMATIC EFFECTS 

With the large sample size of the SDSS-DR3, we now 
have sufficient signal to examine a number of systematic 
effects on the results of our analysis. This section de- 
scribes an investigation into a few of these effects. It is 
advantageous that the entire SDSS sample was observed 
with the same instrument, reduced with the same software, 
and analyzed by one group. In contrast, the compilation 
of Peroux et al. (2003) is based on many surveys carried 
out on over 10 telescopes and reduced and analyzed by at 
least five different individuals. This undoubtedly leads to 
at least minor systematic differences in the surveys, e.g. 
differences in the continuum placement, systematic effects 
related to instrumental resolution, and/or color selection 
of the quasar populations. Of course, if multiple groups 
report the same value, this lends confidence to the sets of 
procedures related to damped Lya surveys. 



(§ 2). Our experience via visual inspection and Monte 
Carlo simulations is that this limit is just satisfactory. 
Lowering the limit to a SNR;j TO = 3 gives many more 
false positives and a greater number of missed damped 
Lya systems. Of course, it is important to test the sen- 
sitivity of our results to SNR; im . Figure 14 presents the 
(a) line density £dla(X) and (b) neutral gas mass density 
fig relative to the full sample as a function of redshift for 
several values of SNR; im or the samples restricted to the 
strict color criteria (Richards et al. 2002). See Table 2 for 
a description of each cut of the SDSS-DR3. 

Examining the figure, we find little difference between 
the various samples. We find, therefore, no systematic bias 
related to SNR;^ or the color-selection criteria. In fact, 
the scatter in the data is smaller than that expected for a 
Poissonian distribution. This is because the samples are 
not independent. Nevertheless, this analysis demonstrates 
that our results are relatively insensitive to SNR^m and 
the color selection criteria. 




Fig. 14. — The top panel shows the line density as a function 
of redshift for the various sample cuts on SNR;i m and color se- 
lection criteria listed in Table 2. The data are plotted relative to 
the SDSS-DR3_4 sample and are offset in redshift for presentation 
purposes only. The values are calculated in the redshift intervals: 
2 = [2.2, 2.5], z = [2.5, 3.], z = [3., 3.5], 2 = [3.5,4.4]. The lower 
panel shows the results for S7^ LA for the same cuts and also rela- 
tive to SDSS-DR3-4. One notes very little difference between the 
various samples. 



5.1. SNRum and Color Selection 

All of the results presented in this paper are based on 
searches for damped Lya systems in the spectral regions of 
quasars where the median SNR exceeds 4 in a 20 pixel bin 
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Fig. 15. — Plot of the quasar magnitude at \ re st = 1300 to 
1350A against the median spectral SNR at A 1400A. There is 
a good correlation between the two quantities although there are 
many outliers due primarily to BAL quasars. 



5.2. Quasar magnitude and SNR 

There are a number of reasons to examine the results as 
a function of the quasar magnitude and/or the SNR of its 
spectrum. These include considerations of dust obscura- 
tion (Fall & Pei 1993), the precision of the Nhi values as 
a function of SNR, and gravitational lensing. We consider 
the quasar magnitude and spectral SNR together because 
we have found a similar dependence on the results for these 
two characteristics. Although there is not a strict one-to- 
one relationship between quasar magnitude and SNR for 
the SDSS survey, the two characteristics are closely asso- 
ciated. 

We have calculated the magnitude of the quasar as fol- 
lows: First, we identify the Sloan filter closest to A = 
(1 + z gso )1325A. Second, we adopt the PSF magnitude for 
the quasar as reported in SDSS-DR3. Third, we flux cali- 
brate the ID quasar spectrum assuming its relative flux is 
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accurate. Finally, we calculate the median flux in the in- 
terval X res t — 1300 to 1350 A (which avoids strong emission 
line features) and convert to AB magnitude. To estimate 
the SNR of the spectrum, we calculate the median SNR 
for X rest = 1440 to 1490 A where z qso < 5 and X rest = 1287 
to 1366A for z qso > 5. Figure 15 plots the quasar magni- 
tude versus the spectral SNR. There is a reasonably tight 
correlation with a few outliers which are primarily exotic 
BAL quasars. 
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Fig. 16. — Line density as a function of redshift for the full sample, 
the brightest 33% of the quasars and the faintest 33% of the quasars 
in each redshift interval. There is a mild trend toward higher line 
density toward the brighter quasars, but the effect is not statistically 
significant. Similar results are seen for the quasars cut on spectral 
SNR. 



The upper panel in Figure 16 presents the line density 
of damped Lya systems as a function of redshift for three 
cuts of the SDSS-DR3_4 sample: (i) the complete sample; 
(ii) the brightest 33% of the quasars; and (iii) the faintest 
33% of the quasars. One notes no significant dependence 
of £dla on quasar magnitude. The lower panel shows the 
same quantity for cuts on the SNR. Note that the me- 
dian magnitude of the bright/faint sample for the lowest 
redshift bin is 18.4 and 19.5 mag respectively. This dif- 
ference in magnitudes is significant and it nearly brackets 
the 'break magnitude' of the quasar luminosity function 
at z — 2 (Boyle et al. 2000). The results indicate that 
if dust obscuration is relevant to damped Lya surveys, 
then the line density is insensitive to its effects. While the 
line density may not be very sensitive to dust obscuration, 
it is possible that our results violate the predictions of 
Fall & Pei (1993). A quantitative statement awaits a full 



treatment which includes the SDSS quasar color selection 
criteria (Murphy et al., in prep). 
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Fig. 17. — Gas mass density as a function of redshift for the full 
sample, the brightest 33% of the quasars and the faintest 33% of 
the quasars in each redshift interval. There is a systematic trend 
of higher LA values toward the brighter quasars. Although the 
effect is not significant at the 2cr level in any given redshift bin, the 
effect is significant at > 95% c.l. when considering all bins together. 
In the text we discuss a number of possible explanations, the most 
likely being gravitational lensing. 



In contrast, we do find a systematic relationship be- 
tween quasar magnitude (and SNR) with 0^ LA . Fig- 
ure 17 presents the results for the same cuts of SDSS- 
DR3_4 as in Figure 16. It is evident that the bright sub- 
sample shows systematically higher f2° LA values than the 
faint sub-sample. Of course, in order to explain the in- 
dependence of £bla to quasar magnitude, all of the dif- 
ference in Q^ LA must arise from the frequency of large 
A^hi damped systems. Indeed, Figure 18 shows that the 
fni(N,X) distributions are comparable for the bright and 
faint sub-samples for A/hi < 10 21 cm -2 but there is a 
factor of 5 difference in the incidence of systems with 
Ahi > 10 214 cm~ 2 . This has the greatest impact on the 
j-jDLA resu i-t; S w hich we now discuss. 

Let us consider possible explanations for the results in 
Figures 17 and 18: 

1. Chance (small number statistics): In any given 
redshift interval in Figure 17 the results are significant at 
only the w la level. Put together, however, the effect 
is significant at greater than the 95% c.l.; the weighted 
mean of the ratio of the bright to faint sub-samples is 
Q° LA (bright)/fi° LA (faint) = L4±0.2. Therefore, we con- 
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Fig. 18. — The H I column density distribution at z = 2.5 to 
3 for the full sample, the brightest 33% of the quasars and the 
faintest 33%. One observes no statistically significant difference in 
at low JVjji value but that the bright sample has 5x 
higher value at large iVjji- 



sider it unlikely that the effect is related only to statistical 
fluctuations. 
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Fig. 19. — Histogram of the relative separation of the damped Lya 
absorption redshift from the redshifts defining the survey path of 
the background quasar (zt and Zi) as defined by Equation 24. One 
notes fewer damped Lyct systems with z ss Zi as expected because 
the incidence of damped Lya systems increases with redshift. 



2. DLA are intrinsic to the quasar: This is a diffi- 
cult assertion to disprove. Figure 19 plots a histogram of 
the relative statistic of the damped Lya systems from the 
quasar 



Zf - Z DLA 
z s - Zi 



(24) 



restricted to sightlines where zt — z% > 0.2. While there 
are fewer systems with zdla ~ Zi, this is expected because 



the incidence of the damped Lya systems increases with 
redshift. At present, we do not consider association with 
the quasar to be an important issue. 

3. The survey has missed damped Lya systems 
with large Nni toward faint quasars: Given that our 
algorithm is most sensitive to spectral regions of very low 
SNR, it is very unlikely that we would miss a damped Lya 
system whose core is much larger than the search window 
6(1 + z)A. Furthermore, the profiles with very large A^hi 
value are the easiest to identify and we would have recov- 
ered most in our visual inspection of the spectra. Finally, 
these systems will probably all show significant metal-line 
absorption and we would have identified them indepen- 
dently of the Lya profile (e.g. § 5.4). 

There is, however, one bias related to this point. If a 
damped Lya profile coincides with the wavelength Aj in 
the quasar spectrum where the median SNR would have 
otherwise exceeded SNR/ im (see § 2), then the presence of 
a damped Lya system would preclude its inclusion in the 
statistical sample (and possibly its discovery altogether). 
That is, the presence of a damped Lya profile at wave- 
length Xdla, where the median SNR of the unabsorbed 
flux exceeds 4, lowers the SNR at Xdla such that Aj oc- 
curs redward of the damped Lya system. This is true for 
damped Lya systems of all A^hi values, although the effect 
will be greatest for larger TVhi- We have accounted for this 
bias, in part, by defining z\ to be 1000 km s _1 redward of 
the redshift corresponding to A;. Nevertheless, this issue 
is a concern, albeit one we consider minor. 

4. Overestimating Ahi in the bright sample: We 
have reexamined the damped Lya profiles for every system 
with iVm" > 10 21 ' 4 cm~ 2 . In no case would we modify the 
best Ahi value beyond the reported uncertainty. Further- 
more, the bright sample has the highest SNR data and 
we consider the AThi values to be well constrained. All 
have associated metal-line absorption and reasonably well 
determined quasar continua. Therefore, we consider this 
explanation to be unimportant. 

5. Underestimating AThi in the faint sample: Be- 
cause these quasars are faint, the spectra have lower SNR 
and more uncertain A^hi values. While this could lead 
to a significant underestimate of A^hi , several effects work 
against it. First, fm(N,X) is sufficiently steep (partic- 
ularly at high A/hi) that a Malmquist bias will actually 
lead to a flattening of fm(N,X) at the high .Ahi end. 
This leads to an overestimate of il^ LA in the faint subset 
because the bias will be more significant for .Ahi values 
with larger uncertainty. Second, the effects of low SNR 
are most significant at low .Ahi values where only a hand- 
ful of pixels may constrain the result. But these systems 
do not contribute significantly to f^ LA . Third, at large 

-Ahi value, the profile extends over ps 100A and is generally 
constrained by tens of pixels. It is true that uncertainty in 
the continuum level is greater, but we do not believe this 
to be the problem. 

To further pursue the possibility that we have underes- 
timated A^hi in the faint sub-sample, we reexamined every 
Lya profile with Ahi > 10 21 cm~ 2 in the z ~ 2.7 redshift 
bin. While we could allow for 0.1 to 0.2 dex larger values 
in a number of cases, this would not account for the differ- 
ences in f2g LA . Furthermore, we note that the fm(N,X) 
value in the A^hi interval 10 21 — 10 21 4 cm" 2 is lower in the 
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Fig. 20. — The (a) Lya profile and (b) metal-line profiles of the super-LLS at z = 2.42 toward J130634. 6+523250. The emission-line feature 
at the center of the Lya profile is due to [Oil] emission from a galaxy at z = 0.112 in the SDSS fiber. If this absorber had no significant 
metal-line absorption, it would have been dropped from our sample altogether. 



faint sub-sample. Therefore, it would require a systematic 
underestimate in TVhi at nearly all values to explain the 
observations. At present, we consider an underestimate of 
A/hi values in lower SNR data to be a possible contribu- 
tion to the differences in LA but unlikely to be the main 
explanation. 

6. Dust Obscuration: To date, every damped Lya 
system observed at high resolution has a measured metal- 
licity exceeding 1/1000 solar (Prochaska et al. 2003b). 
With the presence of metals, there is the prospect that 
the gas has a non-negligible dust content (e.g. Pettini et 
al. 1994). Indeed, damped Lya systems with metallicity 
> 1/10 solar have enhanced Si/Fe ratios which are best 
explained by differential depletion (Prochaska 2003). As 
Ostriker & Heisler (1984) first noted, the damped Lya 
systems with the highest optical depths of dust will ob- 
scure the background quasar and possibly remove it from 
a magnitude limited quasar survey. 

The prediction, as formalized by Fall & Pei (1993), is 
that optical quasar samples will underestimate fl^ LA ow- 
ing to this bias. As Ellison et al. (2004) have stressed, 
the effect will be less significant if the quasar survey ex- 
tends beyond the peak in the luminosity function. In any 
case, this bias implies that the Sl^ values from brighter 
quasars should be lower than the value inferred from faint 
quasars. We observe the opposite trend in Figure 17. 
Therefore, the results are unlikely to be explained by dust 
obscuration. 



7. Gravitational Lensing: If the damped Lya sys- 
tems arise in massive halos or disks, then it is possible that 
they would gravitationally magnify the background quasar 
(Bartelmann & Loeb 1996; Smette, Claeskens & Surdej 
1997; Mailer, Flores, & Primack 1997). In their analysis of 
the SDSS-DR2 quasar database, Murphy & Liske (2004) 
reported an approximately 2a result that the luminosity 
function of quasars with foreground damped Lya systems 
is brighter than those without. Unlike dust obscuration, 
this systematic effect could explain the results in Figure 17. 
In fact, treatments based on describing damped Lya sys- 
tems as exponential disks predict that the effect will be 
greatest for damped Lya systems with large A/hi value 
(Bartelmann & Loeb 1996; Mailer, Flores, & Primack 
1997). 

At present, we believe gravitational lensing to be the 
most viable explanation for the results in Figures 17 and 
18. The key implications for Q^ hA are that (i) our re- 
ported values may be too high and (ii) the evolution of 
q^dla revea j ec i by the entire sample is qualitatively cor- 
rect. Therefore, while this systematic effect is important, 
our conclusions are relatively invariant to it. We note, 
however, that the effect should be even more pronounced 
at z < 2 and may significantly affect damped Lya statis- 
tics at these redshifts (Rao & Turnshek 2000). 

If gravitational lensing does explain the effect, it may al- 
low for a statistical mass measurement of at least the high 
A/hi sightlines. It also motivates a high spatial resolution 
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survey of all damped Lya systems with Nm > 10 21 cm~ 2 . 
In a future paper (Murphy et al. 2005, in prep.) we will 
present a full analysis of gravitational lensing and dust 
obscuration for the SDSS-DR3 sample. 

5.3. Human Error 

There is a non-negligible likelihood that we have over- 
looked a few systems or have made significant errors in 
a few select systems. Regarding PH04, for example, we 
note the incorrect JVhi value for the damped system at 
z = 2.77 toward J084407.29+515311 (§ 3.3) and also a 
bug in our calculation of £(z) for the SDSS-DR1 sam- 
ple. To quickly disseminate corrected and updated re- 
sults to the community, we have established a public web 
site where all of the fits and analysis will be presented 
(http: / /www. ucolick.org/~xavier/SDSSDLA/index. html). 
We encourage the community to report any mistakes with 
our analysis to the lead author via the email address sdss- 
dla@ucolick.org. 

5.4. Unusual Systematic Effects: "Things that go bump 
in the night" 

With a sample of damped Lya systems approaching 
1000, it is not surprising that unexpected systematic errors 
will arise. Figure 20 presents the Lya profile and metal- 
line profiles for the damped Lya candidate at z = 2.42 
toward J130634. 6+523250. This damped Lya candidate 
was not identified by our automated algorithm because of 
significant flux at the center of the Lya profile. Instead, 
the system was identified because of its metal-line absorp- 
tion and we immediately hypothesized that the flux in the 
Lya profile was due to Lya emission from the host galaxy. 
The emission line would be amazingly strong, however, 
and we considered alternate explanations. In due time, we 
realized that the feature is an emission line: [Oil] emission 
from a z = 0.116 galaxy which lies within the 3" SDSS 
fiber. Emission lines of Ha, H/3, and [OIII] are also appar- 
ent in the quasar spectrum. Ignoring the [Oil] emission, 
we have fit the Lya profile and its central value places it 
beneath the statistical threshold for damped Lya systems. 
Nevertheless, this is a systematic effect which leads to an 
underestimate of fm{N, X) at all JVhi value. It is difficult 
to quantify the overall effect here, but it is presumably less 
than 1%. 

6. DISCUSSION AND SPECULATIONS 

The emphasis of this paper is to describe the results of 
the damped Lya survey of the SDSS-DR3 quasar database. 
These results and a discussion of the systematic errors 
were presented in the previous sections. We now consider 
a few of the implications with emphasis on the new re- 
sults. We also compare the observations against theoreti- 
cal treatments of the damped Lya systems within ACDM 
models of galaxy formation. We consider the results from 
the smooth particle hydrodynamic (SPH) simulations of 
Nagamine, Springel, & Hernquist (2004), the Eulerian 
simulations of Cen et al. (2003), and the semi-analytic 
model (SAM) of the Santa Cruz group (Somerville, Pri- 
mack, & Faber 2001; Mailer et al. 2001, 2003). It is 
important to stress that each model includes its own set 
of star formation and feedback recipes which do bear on 
the results for the damped Lya systems. 



Consider first the Ahi frequency distribution, /hi(-/V, X). 
Perhaps the most remarkable result from the SDSS-DR3 
sample is that there is no statistical evidence for any evo- 
lution in the shape of fm(N,X) with redshift (Figure 7). 
There is, however, evidence for evolution in the normaliza- 
tion of fm(N, X) as traced by the trends in the zeroth and 
first moments of the distribution function. These results 
suggest that the gas distribution within galaxies is simi- 
lar at all redshifts and that only the number and/or sizes 
of these galaxies evolve significantly. Another interesting 
result is that the faint-end slope of the fm(N,X) distri- 
bution is 03 w —1.8. This slope matches the faint-end 
slope of the dark matter halo mass function for CDM (e.g. 
Sheth, Mo, & Tormen 2001). If this is not a coincidence, 
it indicates that low mass halos dominate the incidence of 
damped Lya systems at low Nm values. Furthermore, it 
suggests that the cross-section A(X) of low mass galaxies 
is nearly independent of mass. At present, however, we 
consider the correspondence to be a coincidence. 



-21 




20.5 21.0 21.5 22.0 
logio N ffl 

Fig. 21. — The H I frequency distribution of the damped Lya 
systems at z = 3 (top; specifically z = 2.8 to 3.2) and 2 = 4 (bottom; 
specifically z = 3.7 to 4.3) compared against the theoretical curves of 
Mailer et al. (2001) (SAMS; lighter curve) and Nagamine, Springel, 
& Hernquist (2004) (SPH; darker curves; dashed is the D5 model 
and solid is the Q5 model). 



A comparison of the results against ACDM models of 
galaxy formation is presented in Figure 21 at z — 3 and 
z = 4. The /hi(A, X) curves are for the SPH simula- 
tions of Nagamine, Springel, & Hernquist (2004) and the 
SAM model of Mailer et al. (2001). The SAM model 
shows a reasonable match to the shape of fm(N, X) at 
z = 3, yet systematically underpredicts the observations. 
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Mailer et al. (2001) were primarily interested in model- 
ing the kinematics of the damped Lya systems and we 
suspect their results are insensitive to the normalization 
of fni{N,X). Nevertheless, the discrepancy suggests that 
the SAM model has too few damped Lya systems at z = 3. 
Similarly the SPH theoretical curves offer a reasonable 
match to the observations at large Nui value yet signifi- 
cantly undcrprcdict fm(N,X) at low JVhi values. 

The discrepancy has several important implications. First, 
the SPH simulations will predict a significantly smaller 
contribution to fig 1 for the Lyman limit systems at these 
redshifts. In part, this is a restatement of a current prob- 
lem in numerical cosmology: the simulations underpredict 
the incidence of Lyman limit systems by an order of mag- 
nitude (Gardner et al. 2001). Second, we reemphasize the 
critical constraint imposed by two sets of observations of 
the damped Lya systems: (a) fm(N, X) and (b) the ve- 
locity width distribution of the low-ion profiles (Prochaska 
& Wolfe 1997). Jedamzik & Prochaska (1998) first em- 
phasized that there is a tension between fm{N, X) and 
the velocity width distribution. Specifically, if one intro- 
duces enough low mass halos to match the low TVhi end 
°f fm{N,X) this implies far too many damped Lya sys- 
tems with small velocity width. Therefore, if the SPH 
simulations were modified to match the fm(N, X) obser- 
vations (e.g. via different treatments of feedback or ra- 
diative transfer), we predict that (i) the dependence of 
the cross-section A{X) on the halo mass as reported by 
Nagamine, Springel, & Hernquist (2004) would change 
significantly and (ii) the simulations would not reproduce 
the damped Lya kinematics. We also note that the re- 
sults of Nagamine, Springel, & Hernquist (2004) show 
that fm(N, X) is relatively insensitive to treatments of 
feedback. It is possible that fm{X, X) and the damped 
Lya kinematics present a fundamental challenge to sce- 
narios of galaxy formation within the ACDM cosmology. 

Now consider the evolution in the line density of damped 
Lya systems. Because ^dla(A) is the product of the co- 
moving number density and cross-section of damped Lya 
systems (Equation 14), the rise in ^dla(A) reflects an 
increase in one or both of these quantities. We can es- 
timate the evolution in «dla by considering the Press- 
Schechter formalism for the mass function of dark matter 
halos (e.g. Peacock 1999). Within this formalism one can 
define a mass scale M» as a function of redshift which 
identifies the typical mass of assembly. At z = 3 in the 
ACDM cosmology with er 8 = 1, M* « 1O 1O M . Ha- 
los with masses M ^> will have a number density 
which increases significantly from z = 3 to 2. There- 
fore, damped Lya systems are unlikely to arise in ha- 
los with M 3> 10 10 M Q at these redshifts. Furthermore, 
although halos with M < M* do have a decreasing co- 
moving number density, the decrease is small (i.e. < 15% 
for M = 10 9 M Q ). Within the context of the hierar- 
chical cosmology, therefore, the most likely explanation 
for the decrease in ^dla(A) is a corresponding decline 
in A(X). There are several physical effects which could 
reduce A(X), e.g., star formation, AGN feedback, pho- 
toionization, and galactic winds. Current estimates of the 
star formation rates are relatively uncertain over this red- 
shift range (e.g. Wolfe, Gawiser, & Prochaska 2003) but 
probably cannot account for the bulk of evolution. Simi- 



larly, the intensity of the extragalactic background radia- 
tion field is believed to be roughly constant (e.g. Haardt & 
Madau 1996) and photoionization effects should be minor. 
Therefore, we contend that one or more feedback mecha- 
nisms have significantly reduced the typical cross-section 
of galaxies to damped Lya absorption from z = 3 to 2. 

Turning our attention to the neutral gas mass density, 
we present the damped Lya observations in Figure 22 in- 
cluding recent results at z < 1.6 from Rao, Turnshek, & 
Nestor (in prep.) compared against several ACDM mod- 
els, and also current estimates of the mass density of stars 
(star), neutral gas (diamond), and Irr galaxies (+ sign) at 
z ~ 0. It is important to note that the theoretical models 
of Somerville et al. and Nagamine et al. include contribu- 
tions to Q g from all quasar absorption line systems (i.e. 
they calculate H™), whereas the observational measure- 
ments are restricted to the damped Lya systems. There- 
fore, if the Lyman limit systems do contribute significantly 
to f2™, we must increment the observations accordingly. 
Alternatively, we recommend that future theoretical anal- 
ysis be restricted to sightlines with AThi > 2 x 10 20 cm~ 2 
(e.g. Cen et al. 2003). 

Examining the damped Lya observations alone, we note 
a relatively confusing picture. While the results based pri- 
marily on the SDSS-DR3 observations (z > 2.2) show a 
well behaved trend with redshift, the estimates of 0° LA 
at z < 2 are all consistent with one another with a central 
value higher than the z — 2.3 measurement. While each 
individual measurement at z < 2 is consistent with the 
SDSS data point at z — 2.3, taken together the difference 
is significant at > 95% c.l. In fact, if one were to ignore the 
redshift interval at z = 2.3, the observations are consistent 
with no evolution in f2^ LA from z = 0.1 to z = 4.5. Before 
reaching such a conclusion, however, we wish to empha- 
size several points: (1) the value in the z = [2.2, 2.5] inter- 
val is very well determined because it is based on ~ 100 
damped Lya systems; (2) the z ~ 2 data point is derived 
from a heterogeneous sample of observations and is domi- 
nated by a single damped Lya system; (3) the low redshift 
values are based on the novel yet non-standard technique 
of Rao & Turnshek (2000). Their approach has its own 
set of systematic errors which are uniquely different from 
the damped Lya survey described in this paper; and (4) 
we argued in § 5.2 that the £lg LA results may be biased 
by gravitational lensing. If this is confirmed, the effect 
should be largest at z < 2. These points aside, it is clear 
that achieving better than 10% precision on LA at z < 2 
is a critical goal of future damped Lya surveys. At z ~ 2, 
this will require a large observing campaign with a spec- 
trometer efficient down to 3200A. At lower redshift, one 
will require a new UV space observatory. 

Comparing the models, we note a wide range of pre- 
dictions. The most successful models at z > 2 are from 
Nagamine, Springel, & Hernquist (2004), in particular 
their D5 run. This model reproduces both the shape and 
normalization of the observed data. In contrast, the Eu- 
lerian and SAM models overpredict LA at all redshifts 
and at z < 3 respectively even if one adopts a 1.5 multi- 
plicative correction due to the Lyman limit systems. Be- 
cause the cooling processes and time-scales are comparable 
in all of the models (e.g. Pearce et al. 2001), the differ- 
ences must be due to processes which consume or ionize the 
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Fig. 22. — The gas mass density of neutral gas for the damped Lyct systems from our analysis (2 > 1.5 with SDSS restricted to 2 > 2.2) 
and recent results (purple) from Rao, Turnshek, & Nestor (in prep.). These observations are compared against the theoretical curves of Cen 
et al. (2003) (EULER; green), Somerville, Primack, & Faber (2001) (SAMS; red), and Nagamine, Springel, & Hernquist (2004) (SPH; blue 
— dotted is the D5 model and solid is the Q5 model). The data points at 2 = correspond to the stellar mass density from Cole et al. (2001) 
(star), the neutral gas mass density (diamond), and the mass density of Irr galaxies (+-sign; Fukugita et al. 1998). 



neutral gas (e.g. star formation, galactic winds, AGN feed- 
back). The indication from our observations is that the 
Eulerian and SAM models underpredict these processes at 
z > 2 and therefore overestimate fl g . 

Now consider a comparison of the high- z Sl^ LA values 
with the mass density at z — of stars f2*, neutral gas 
f2g lcm , and dwarf galaxies f^. The stellar mass density 
was estimated by Cole et al. (2001) from an analysis of 
the 2dF survey. The uncertainty in this estimate is dom- 
inated by systematic error related to the assumed initial 
mass function. Wolfe et al. (1995) first stressed that the 
gas mass density of the damped Lya systems is compara- 
ble to . Adopting the ACDM cosmology and the current 
estimate of we now find that f2* exceeds S1° LA by a 
factor of two to three at z = 3. Because star formation 
is ongoing at all redshifts probed by the damped Lya sys- 
tems (e.g. Chen & Lanzetta 2003; Wolfe, Prochaska, & 
Gawiser 2003; M0ller et al. 2002), however, it would be 
wrong to interpret the maximum f2^ LA value as the total 
gas mass density contributed by the damped Lya systems. 
It is more accurate to regard damped Lya systems as neu- 
tral gas reservoirs in which gas consumed by star formation 
is replaced by neutral gas accreted from the IGM. In this 
manner, the mass density of damped Lya systems would 
be less than at any given epoch. Therefore, it is reason- 
able to assume that all of the stars observed today arose 
from gas originating in the damped Lya systems. Indeed, 



this is the generic conclusion of current cosmological simu- 
lations. These points aside, it is evident that the damped 
Lya systems contain sufficient gas mass to account for all 
of the stars observed in disks today provided current esti- 
mates (fidisk ~ £!*/3; Fukugita et al. 1998). 

Examining Figure 22, we note that the difference be- 
tween nf cm at z = and ft° LA at z = 2.3 is 0.15 ± 0.08, 
i.e. consistent with very little evolution if one ignores the 
results at z ~ 1 we present from Rao et al. (in prep). Is 
this a remarkable coincidence or is there a physical ex- 
planation (e.g. the gas accretion rate equaled the star 
formation rate over the past lOGyr)? Before addressing 
this question, consider the determination of Q^ lcm . The 
value is derived from large area surveys of 21cm emission- 
line observations for H I 'clouds'. The most recent results 
are from the HIPASS survey as analyzed by Zwaan et al. 
(2005). The analysis proceeds by fitting a functional form 
(a T-function) to the H I mass distribution of all galaxies 
detected. The fl^ lcm value is simply proportional to the 
first moment of this distribution function. The key point 
to emphasize is that the analysis includes all H I gas within 
the beam of HIPASS, i.e., the values are independent of 
H I surface density and are not restricted to the damped 
Lya threshold. In the extreme case that the H I gas is pre- 
dominantly distributed in Mestel disks with S(i?) oc R , 
the contribution to fi 21 ™ 1 from damped Lya systems is 
DLA /r! 2icm = S trunc /E DLA = ^tru„c/10 2a3 cm" 2 where 
Strunc and ./Vtrunc are the surface density and H I column 



PROCHASKA, HERBERT-FORT, k WOLFE 



23 



density at the truncation radius of the Mestel disk and 
Sdla is the surface density at the damped Lya thresh- 
old. One may adopt a truncation radius for the Mestel 
disk according to the photoionization edge set by the in- 
tensity of the extragalactic UV background (EUVB) or 
observational constraints (e.g. Maloney 1993). Adopting 
a relatively conservative value of -/Vt rU nc = 10 195 cm -2 , we 
find £!g LA = 0.15£l 21cm . Of course, this is a somewhat 
extreme model because many gas disks have exponential 
profiles. For an exponential disk, the value of £1° LA /£l 21cm 
is sensitive to the ratio of the radii corresponding to the 
damped Lya threshold and the exponential scale-length. 
Taking R D la/Rcx P = 2, we find £1° LA = 0.5Q g . These 
simple estimates imply that a significant fraction of £l 21cm 
at z — could come from Lyman limit systems. 

One can also place an empirical constraint on £l° LA /fi s 
from the H I distribution function at z = as measured by 
Ryan- Weber, Webster, & Staveley-Smith (2003). If one 

(a) adopts their double power-law fit to fm(X 7 X) with ex- 
ponents a 3 = -1.4 and a 4 = -2.1 and N d = 10 20 ' 9 cm~ 2 , 

(b) extrapolates this frequency distribution to N min = 
10 19 cm~ 2 , and (c) integrates to N max = 10 21,6 cm~ 2 (the 
largest ./Vhi value measured), we findft° LA = 0.60,. There 
is significant uncertainty and degeneracy in the fitted pa- 
rameters of fm{N, X), however, and £l g LA /f2 g could be 
significantly higher or lower. Nevertheless, it is our present 
view 8 that £1^ LA at z = is « 0.5£l g . Therefore, we argue 
that a strict comparison of £1° LA at z = and z — 2 could 
imply a decrease in £1^ LA over the past 10 Gyr. 

Given the uncertainties inherent to O™ and the fact 
that it has an ambiguous physical meaning, we now ad- 
vocate an alternate definition for £1 9 motivated by star 
formation processes. Define £l g cut to be the mass density 
of gas that is predominantly neutral. Under this definition, 
j-jNcut evo i ves as follows. The decrease of the EUVB in- 
tensity with decreasing redshift implies that lower surface 
densities will be neutral at lower redshift. Therefore, it is 
very possible that £l g eut equals £l g LA at z — 2 and the 
HIPASS value at z = 0. In this case, the gas reservoir 
available to star formation may have remained nearly con- 
stant over the past 10 Gyr (subject, of course, to the value 
of £l^ cut from z = to 2). We emphasize that this is con- 
sistent with the presence of star formation provided gas 
replenishment from the IGM is available. Future progress 
on these issues will require an accurate determination of 
/hP at all redshifts and also an accurate assessment of 
the ionization state of the g clS clS 3j function of H I column 
density. 

Finally, compare £l g LA at z — 3 against the mass density 
in dwarf galaxies £1^ as derived by Fukugita ct al. (1998). 
The value presented here is their estimate for the stellar 
mass density in Irr galaxies boosted by a factor of 2 to 
account for the gas mass and by 30% for Ho = 75km s _1 
Mpc" 1 . We emphasize that the estimate of £1^ is diffi- 
cult to make and may be more uncertain than depicted 
here. Nevertheless, it is evident that the mass density 
of dwarf galaxies is roughly 10 x smaller than £l g LA at 
z = 3. A number of authors have argued that the major- 
ity of damped Lya systems will evolve into dwarf galaxies 

8 Since submission of our paper, we have been informed that a 
better estimate is 15% (Zwaan, priv. comm.). 



today. It is obvious, however, that the majority of damped 
Lya systems (by mass ) cannot evolve into dwarf galaxies. 
And because damped Lya systems with all values of A%i 
contribute to £7^ LA (Figure 10), we argue that the damped 
Lya systems by number are not the progenitors of dwarf 
galaxies. As we have argued previously, it is premature 
to interpret the observed abundance patterns as signif- 
icant evidence for a link between damped Lya systems 
and dwarf galaxies (Prochaska 2003). Furthermore, the 
kinematic characteristics of the damped Lya systems pre- 
clude such an interpretation (Prochaska & Wolfe 1997; 
Prochaska, Ryan- Weber, & Staveley-Smith 2002). Re- 
vealing the true nature of the damped Lya systems, of 
course, will require detailed follow-up studies (high reso- 
lution spectroscopy, deep imaging, etc.) of the galaxies 
identified in this survey. 
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